Extending an interoperable platform to facilitate the creation of multilingual and multimodal NLP applications

نویسندگان

  • Georgios Kontonatsios
  • Paul Thompson
  • Riza Theresa Batista-Navarro
  • Claudiu Mihaila
  • Ioannis Korkontzelos
  • Sophia Ananiadou
چکیده

U-Compare is a UIMA-based workflow construction platform for building natural language processing (NLP) applications from heterogeneous language resources (LRs), without the need for programming skills. U-Compare has been adopted within the context of the METANET Network of Excellence, and over 40 LRs that process 15 European languages have been added to the U-Compare component library. In line with METANET’s aims of increasing communication between citizens of different European countries, U-Compare has been extended to facilitate the development of a wider range of applications, including both multilingual and multimodal workflows. The enhancements exploit the UIMA Subject of Analysis (Sofa) mechanism, that allows different facets of the input data to be represented. We demonstrate how our customised extensions to U-Compare allow the construction and testing of NLP applications that transform the input data in different ways, e.g., machine translation, automatic summarisation and text-to-speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovery and registration of components in multimodal systems distributed on the IoT

One of the major gaps in the current HTML5 web platform is the lack of interoperable means for an application to discover services and applications available in a given space and network. This problem is shared by the multimodal applications developed with web technologies, for example, in smart houses or applications for the Internet of Things. To address this gap, we produced a SOA approach f...

متن کامل

MultiNews: A Web collection of an Aligned Multimodal and Multilingual Corpus

Integrating Natural Language Processing (NLP) and computer vision is a promising effort. However, the applicability of these methods directly depends on the availability of a specific multimodal data that includes images and texts. In this paper, we present a collection of a Multimodal corpus of comparable document and their images in 9 languages from the web news articles of Euronews website.1...

متن کامل

Integrating NLP Using Linked Data

We are currently observing a plethora of Natural Language Processing tools and services being made available. Each of the tools and services has its particular strengths and weaknesses, but exploiting the strengths and synergistically combining different tools is currently an extremely cumbersome and time consuming task. Also, once a particular set of tools is integrated, this integration is no...

متن کامل

Generating Dialogue Applications with the GEMINI Platform

Within the EC funded research project GEMINI (Generic Environment for Multilingual Interactive Natural Interfaces) we aim at the development of a platform that assists the user to semi-automatically produce interactive multilingual and multimodal dialogue interfaces to databases. To demonstrate the platform’s efficiency two different applications were generated using this platform. Main feature...

متن کامل

NIF: An ontology-based and linked-data-aware NLP Interchange Format

We are currently observing a plethora of Natural Language Processing tools and services being made available. Each of the tools and services has its particular strengths and weaknesses, but exploiting the strengths and synergistically combining different tools is currently an extremely cumbersome and time consuming task. Also, once a particular set of tools is integrated this integration is not...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013